Fast Similarity Search of Multi-Dimensional Time Series via Segment Rotation
نویسندگان
چکیده
Multi-dimensional time series is playing an increasingly important role in the “big data” era, one noticeable representative being the pervasive trajectory data.Numerous applications ofmulti-dimensional time series all require to find similar time series of a given one, and regarding this purpose, Dynamic Time Warping (DTW) is the most widely used distance measure. Due to the high computation overhead of DTW, many lower bounding methods have been proposed to speed up similarity search. However, almost all the existing lower bounds are for general time series, which means they do not take advantage of the unique characteristics of higher dimensional time series. In this paper, we introduce a new lower bound for constrained DTW on multi-dimensional time series to achieve fast similarity search. The key observation is that when the time series is multi-dimensional, it can be rotated around the time axis, which helps to minimize the bounding envelope, thus improve the tightness, and in consequence the pruning power, of the lower bound. The experiment result on real world datasets demonstrates that our proposed method achieves faster similarity search than state-of-the-art techniques based on DTW.
منابع مشابه
Analytical D’Alembert Series Solution for Multi-Layered One-Dimensional Elastic Wave Propagation with the Use of General Dirichlet Series
A general initial-boundary value problem of one-dimensional transient wave propagation in a multi-layered elastic medium due to arbitrary boundary or interface excitations (either prescribed tractions or displacements) is considered. Laplace transformation technique is utilised and the Laplace transform inversion is facilitated via an unconventional method, where the expansion of complex-valued...
متن کاملPhysical Database Design for Efficient Time-Series Similarity Search
Similarity search in time-series databases finds such data sequences whose changing patterns are similar to that of a query sequence. For efficient processing, it normally employs a multi-dimensional index. In order to alleviate the well-known dimensionality curse, the previous methods for similarity search apply the Discrete Fourier Transform (DFT) to data sequences, and take only the first tw...
متن کاملMLR-Index: An Index Structure for Fast and Scalable Similarity Search in High Dimensions
High-dimensional indexing has been very popularly used for performing similarity search over various data types such as multimedia (audio/image/video) databases, document collections, time-series data, sensor data and scientific databases. Because of the curse of dimensionality, it is already known that well-known data structures like kd-tree, R-tree, and M-tree suffer in their performance over...
متن کاملOptimal Construction of Multi-Dimensional Indexes in Time-Series Databases: A Physical Database Design Approach
Similarity search in time-series databases is an operation that finds such data sequences whose changing patterns are similar to that of a query sequence. Typically, it hires the multi-dimensional index for its efficient processing. In order to alleviate the dimensionality curse, a problem in high-dimensional cases, the previous methods for similarity search apply the Discrete Fourier Transform...
متن کاملRelational Implementation of Multi-dimensional Indexes for Time Series
Similarity search over time-series data is a useful, but expensive, application. Sequence data can be transformed via an orthonormal transformation, which is then dimensionally reduced, to be indexed. Rtrees and R*-trees have been used for this purpose. These do not scale well, however, for very large datasets. We propose a new indexing data-structure within relational tables, indexed with B+-t...
متن کامل